Observations on Nonlinear Risk-Sensitive Control
نویسندگان
چکیده
This paper clariies the relationship between risk-sensitive and robust control. This topic has received attention in the recent literature. We show that the formulations are related through an appropriate limit that is more natural than the small noise limit considered previously. The relationship we discuss is more closely related to that which has been shown in the linear systems setting. 1 System Description and Problem Formulation We consider a discrete-time stochastic system de-ned on a nite time horizon, k 2 f0 where at time k, x k 2 X is the state of the system, u k 2 U is the control, y k 2 Y is the observed output, z k 2 < is the controlled output, and where w N?1 =1) are sequences of random variables which represent disturbances to the system. We assume the initial state of the system x 0 has a known probability distribution 0 on the set of states X. The total cost Z incurred in the system's evolution is given by the sum of the controlled outputs at each time, that is: Z = N?1 X k=0 z k (4) The total cost will have a probability distribution induced by the policy , and can be parametrized by the initial distribution 0 on x 0. Based on this distribution , we can consider a number of criteria for the optimal control of the system.
منابع مشابه
An Information-State Approach to Risk-Sensitive Tracking Problems
In this paper we use the information-state approach to obtain solutions to risk-sensitive quadratic control problems. Speci cally we consider the case of tracking a desired trajectory. Results are presented for linear discrete-time models with Gaussian noise, and also for nite-discrete state, discrete-time hidden Markov models with continuous-range observations. These results give insight to mo...
متن کاملRisk Sensitive, Nonlinear Optimal Control: Iterative Linear Exponential-Quadratic Optimal Control with Gaussian Noise
In this contribution, we derive ILEG, an iterative algorithm to find risk sensitive solutions to nonlinear, stochastic optimal control problems. The algorithm is based on a linear quadratic approximation of an exponential risk sensitive nonlinear control problem. ILEG allows to find risk sensitive policies and thus generalizes previous algorithms to solve nonlinear optimal control based on iter...
متن کاملNonlinear Discrete-time Risk-sensitive Optimal Control
This paper is devoted to the study of the connections among risk-sensitive stochastic optimal control, dynamic game optimal control, risk-neutral stochastic optimal control and deterministic optimal control in a nonlinear, discrete-time context with complete state information. The analysis worked out sheds light on the profound links among these control strategies, which remain hidden in the li...
متن کاملRisk - Sensitive Dual Control
In this paper, we develop new results concerning the risk-sensitive dual control problem for output feedback nonlinear systems, with unknown time-varying parameters. These results are not merely immediate specializations of known risk-sensitive control theory for nonlinear systems, but rather, are new formulations which are of interest in their own right. A dynamic programming equation solution...
متن کاملRisk-Sensitive Control and Dynamic Games for Partially Observed Discrete-Time Nonlinear Systems
In this paper we solve a finite-horizon partially observed risk-sensitive stochastic optimal control problem for discrete-time nonlinear systems and obtain small noise and small risk limits. The small noise limit is interpreted as a deterministic partially observed dynamic game, and new insights into the optimal solution of such game problems are obtained. Both the risk-sensitive stochastic con...
متن کاملFinite- dimensional optimal controllers for nonlinear plants
Optimal risk sensitive feedback controllers are now available for very general stochastic nonlinear plants and performance indices. They consist of nonlinear static feedback of so called information states from an information state filter. In general, these filters are linear, but infinite dimensional, and the information state feedback gains are derived from (doubly ) infinite dimensional dyna...
متن کامل